Visual body pose analysis for human-computer interaction

نویسنده

  • Michael Van den Bergh
چکیده

Human-Computer Interaction (HCI) is the study of interaction between people (users) and computers. The recent advances in computing technology push the interest in human-computer interaction in other ways than the traditional keyboard, mouse or keypad devices. The work presented in this thesis uses computer vision to enhance the HCI, by introducing novel real-time and marker-less gesture and body movement-based systems. Real-time systems have a high refresh rate and minimal latency, providing the user with smooth and instantaneous interaction with the system. Marker-less systems allow a natural interaction without wearing special markers or special tracking suits, which are generally required in modern day tracking systems. The systems described in this thesis aim to achieve this real-time marker-less HCI. They are based on vision and built with standard computers equipped with standard color cameras. The goal set for this work is hand gesture-based interaction with large displays, as well as full body pose recognition for interaction where the user is immersed in a virtual environment. The systems described in this thesis can be divided into three components: (1) preparing the input for detection and recognition, which includes segmentation and reconstruction; (2) detecting of body parts and recognition of body poses and hand gestures; (3) using the detection/recognition to steer the application. These three components are reflected in chapters 2 to 4 of this thesis. Segmentation and Reconstruction. The first part of the thesis provides a brief summary of foreground-background segmentation, skin color segmentation and 3D hull reconstruction. Skin color 4 segmentation usually suffers from changes in lighting and of the user. Therefore, a novel and improved skin color segmentation algorithm is introduced, which combines an offline and an online model. The online skin color model is updated at run-time based on color information taken from the face region of the user. Detection and Recognition. The second part of the thesis starts with a summary of how the face, eye, hand and finger locations can be detected in a camera image. Then, a novel body pose recognition system is introduced based on Linear Discriminant Analysis (LDA) and Average Neighborhood Margin Maximiza-tion (ANMM). This system is able to classify poses based either on 2D silhouettes or 3D hulls. Using a similar technique, a novel hand gesture recognition system is introduced. Both the body pose and hand gesture recognition systems are improved for speed with the help of Haarlets. A novel Haarlet training algorithm is …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

تخمین چنددوربینی حالت سه بعدی انسان با برازش افکنش مدل اسکلت سه بعدی مفصل دار در تصاویر سایه نما

Automatic capture and analysis of human motion, based on images or video is important issue in computer vision due to the vast number of applications in animation, surveillance, biomechanics, Human Computer Interaction, entertainment and game industry. In these applications, it is clear that 3D human pose estimation is an essential part. Therefore, its accuracy has a great effect on the perform...

متن کامل

Construction and Analysis of Tissue-Specific Protein-Protein Interaction Networks in Humans

We have studied the changes in protein-protein interaction network of 38 different tissues of the human body. 123 gene expression samples from these tissues were used to construct human protein-protein interaction network. This network is then pruned using the gene expression samples of each tissue to construct different protein-protein interaction networks corresponding to different studied ti...

متن کامل

‘vVISWa’ – A Multilingual Multi-Pose Audio Visual Database for Robust Human Computer Interaction

Automatic Speech Recognition (ASR) by machine is an attractive research topic in signal processing domain and has attracted many researchers to contribute in this area of signal processing and pattern recognition. In recent year, there have been many advances in automatic speech reading system with the inclusion of audio and visual speech features to recognize words under noisy conditions. The ...

متن کامل

Visual Estimation of Attentive Cues in HRI: The Case of Torso and Head Pose

Capturing visual human-centered information is a fundamental input source for effective and successful human-robot interaction (HRI) in dynamic multi-party social settings. Torso and head pose, as forms of nonverbal communication, support the derivation people’s focus of attention, a key variable in the analysis of human behaviour in HRI paradigms encompassing social aspects. Towards this goal,...

متن کامل

Machine Learning for Image Based Motion Capture

Image based motion capture is a problem that has recently gained a lot of attention in the domain of understanding human motion in computer vision. The problem involves estimating the 3D configurations of a human body from a set of images and has applications that include human computer interaction, smart surveillance, video analysis and animation. This thesis takes a machine learning based app...

متن کامل

Human 3D Pose Estimation and Activity Recognition from Multi-View Videos: Comparative Explorations of Recent Developments

This paper presents a review and comparative study of recent multi-view approaches for human 3D pose estimation and activity recognition. We discuss the application domain of human pose estimation and activity recognition and the associated requirements, covering: advanced Human-Computer Interaction (HCI), assisted living, gesture-based interactive games, intelligent driver assistance systems, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010